Random Ancestor Trees 



O 

(N 

Ph. 
< 
O 



I 

o 



> 
o 

On 
O 

o 



X 




E. Ben-Nairn^ and P. L. Krapivsky^ 

^ Theoretical Division and Center for Nonlinear Studies, 
Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA 
^Department of Physics, Boston University, Boston, Massachusetts 02215, USA 

We investigate a network growth model in which the genealogy controls the evolution. In this 
model, a new node selects a random target node and links either to this target node, or to its 
parent, or to its grandparent, etc; all nodes from the target node to its most ancient ancestor are 
equiprobable destinations. The emerging random ancestor tree is very shallow: the fraction <;„ of 
nodes at distance n from the root decreases super-exponentially with n, g„ — e~^/(n — 1)!. We find 
that a macroscopic hub at the root coexists with highly connected nodes at higher generations. The 
maximal degree of a node at the nth generation grows algebraically as N^^^" where A'^ is the system 
size. We obtain the series of nontrivial exponents which are roots of transcendental equations: 
Pi = 1.351746, /?2 — 1.682201, etc. As a consequence, the fraction pk of nodes with degree k has 
algebraic tail, pk ~ fc^^, with -y = Pi + 1 — 2.351746. 

PACS numbers: 89.75.Hc, 02.10.Ox, 05.40.-a, 02.50.Ey 



I. INTRODUCTION 

The interplay between dynamics, structure, and func- 
tion of complex networks is the subject of intense current 
research A wide spectrum of social, technical, and 

biological networks have broad degree distributions with 
a power- law tail [1, Q. Further, many real- life networks 
also have macroscopic hubs that are connected to a finite 
fraction of all nodes and these hubs play important func- 
tion in the network 043- In this study, we introduce a 
minimal model that captures both of these features. 

Common mechanisms of network growth include pref- 
erential attachment copying, and re-direction [lol - 
In these processes, a target node is identified and the 
attachment is either to the target node itself or to one of 
its immediate ancestors. These growth processes result 
in heterogeneous networks with broad degree distribu- 
tions. However, there are no macroscopic hubs because 
the maximal degree in these networks grows sub-linearly 
with the network size. 

In this study, we consider the complementary situa- 
tion where attachment to any ancestor is possible. We 
show that the resulting network has a fundamentally dif- 
ferent structure: there is coexistence between a hub with 
a macroscopic degree and highly connected nodes with 
sub-macroscopic degrees. Moreover, there is a series of 
exponents that characterize how the maximal degree at 
a given distance from the root scales with system size. 

To model situations where the genealogy controls the 
network growth, we start with a single node, the root, 
and add nodes sequentially, so that each new node at- 
taches to an existing node through a two-step process 
(see Fig.[T]): 

• The new node selects a target node at random. 

• The new node attaches to one node, selected at ran- 
dom from the set of nodes that includes the target 
node and all its ancestors. 
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FIG. 1: The random ancestor tree. A new node attaches, 
with equal probabilities, to one of three nodes: the target 
node (marked by x), its parent, or its grandparent. 



Since each new node adds a single link, the growing net- 
work retains a tree structure, and apart from the root, 
each node has a single parent node. We term the result- 
ing structure the random ancestor tree. An appealing 
feature of this model is the absence of control parame- 
ters. 

Our motivation for this model comes from social, schol- 



arly, and reference networks For example, the pub- 
lished literature in science, law, and religion grows by 
sequential addition of material. Starting with some refer- 
ence, a scholar typically studies an entire line of preceding 
reports or rulings, and refers to the most appropriate or 
the most important pub lished literature, not necessarily 
the most recent one [15l - [l7| . In this sense, the evolution 
of such reference networks is coupled to their genealogy. 

We find that the random ancestor tree has a single 
macroscopic hub, namely the root, that is connected to 
a fraction e^^ of all nodes. Furthermore, the degree dis- 
tribution is broad as the fraction pk of nodes with degree 
k decays algebraically. 
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7 = 2.351746, 



(1) 



when k oo. Interestingly, the exponent 7 is obtained 
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as a root of a transcendental equation. We also obtain a 
series of nontrivial exponents 7„ , each characterizing the 
degree distribution of nodes at the nth generation. As n 
increases, the exponents 7„ increase, so that the nodes 
become less and less connected as their distance from the 
root increases. 

Throughout this paper, we focus on the asymptotic 
behavior of large trees. We first analyze the genealogi- 
cal structure of the random ancestor tree in section HIl 
The depth distribution indicates that the root is a macro- 
scopic hub, and in section IIIII we show that the root is 
the only such hub. We then establish (Sec. irv|) the de- 
gree of the most connected nodes at higher generations 
and discuss the implications of these results for the de- 
gree distribution (Sec. |V|. Other structural properties 
including the number of leaves, and the likelihood that 
the tree has star or chain topology are derived in section 
IVIl Conclusions are given in section IVlIl 



to the nth generation, 



Gn{N)=gnN. 



(4) 



The coefhcients gn are normalized, X)n>i5" ~ 1- Fur- 
ther, since Go — 1 we have go = in the limit N — s- oo. 
Equation Q shows that the fractions gn satisfy 



9n 



(5) 



It is convenient to re-write Eq. ([5]) as the explicit recur- 
sion 



gn 



(6) 



By evaluating the first few terms, g2 = gi, gs = 5i/2!, 
54 = 5i/3!, and by induction, gn = gi/(n— 1)!. The nor- 



malization condition J2r. 



gn 



1 sets gi 



Thus, 



II. GENEALOGICAL STRUCTURE 

We treat the random ancestor tree as a genealogical 
tree and group nodes according to generation (figure [l} . 
The first generation consists of nodes at distance one 
from the root, the second generation includes nodes at 
distance two from the root, etc. In principle, the number 
of nodes in each generation fluctuates from one realiza- 
tion to another, yet in the thermodynamic limit such 
fluctuations are negligible, and we therefore study the 
average number of nodes in each generation. 

Let N be the total number of nodes in the tree and let 
Gn (N) be the average number of nodes at distance n from 
the root. Only the root belongs to the zeroth generation 
and therefore, Go{N) = 1. The average number of nodes 
in the flrst generation, Gi{N), grows according to the 
rate equation 



dGi Gq Gi G2 



(2) 



Hereinafter we treat the size of the network TV as a con- 
tinuous variable; our results are asymptotically exact in 
the limit iV — 00. Equation ([2]) reflects that new nodes 
are added to the first generation only when a new node 
links to the root. The first term on the right-hand side 
corresponds to the situation when the root is chosen as 
the target node; the second term accounts for situations 
when a node in the first generation is chosen — then the 
probability that the actual link will be to the root is 1/2; 
the following terms describe situations where the target 
nodes are in higher generations. 

The generalization of ^ to arbitrary generation n is 
straightforward, 



dGn 
dN 



Gi-i 



^ iN 



(3) 



(n-1)! 



(7) 



indicating that the number of nodes decays super- 
exponentially with increasing generation or alternatively, 
depth. Hence, the random ancestor tree is exception- 
ally shallow. For instance, the average depth is finite: 

(") = E„>i '^9n = 2. 

The genealogical structure of the random ancestor tree 
drastically differs from the genealogical structure of other 
growing networks |ll| . In particular, random recursive 



trees 18, which grow by attachment of new nodes 
to random target nodes, have the depth distribution 
Gn{N) = (In iV)"/n!. Thus, there is a sharp peak at 
n-pcak ^ In A/', and furthermore, the maximal depth, de- 
fined as the deepest non-empty generation n„iax, also 
scales logarithmically with system size, n^ax ~ In [ll|. 
This behavior is very robust and occurs in most growing 
trees [20l-[2^ , but there are exceptions where the maximal 
depth grows algebraically with system size [2^ . 

For the random ancestor tree, the generation profile 
([7]) is a sharply decreasing function of depth, and from 
the criterion Gn^^^ ~ 1 we estimate the maximal depth 



IniV 



(8) 



We now introduce the fraction of nodes that belong 



ln(ln7V) ■ 

This depth behavior reflects that the tree is shallow 0] . 

III. MACROSCOPIC HUB 

The random ancestor tree has a macroscopic hub: the 
root is connected to a finite fraction, = 0.367879, 
of all nodes as follows from Q. Such macroscopic hubs 
are found in social networks [14], and have an important 
function in as transportation networks [1. [25l - [28j . Yet, 
with a few exceptions including super-linear preferential 
attachment [1, [ll|, [2^ , most models of network growth 
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have no such hubs and instead, the most central nodes 
have degrees that grow sub-hnearly with network size. 

To find out whether, in addition to the root, there are 
other hubs, let's consider the most connected node at the 
first generation. We denote the number of descendants of 
this node in the second generation by H2 (N) , the number 
of its descendants in the third generation by H3{N), etc. 
The equation governing Hj (N) is identical to (O , 



dHj 
ItN 



E 



iN 



(9) 



with Hi = 1. By definition, the node degree is 1+H2{N). 

Suppose that the most connected node at the first 
generation has macroscopic degree, H2 ~ N. Since 
dHs/dN > H2/{2,N) then iJg ~ as well. In general. 



Hj{N) = UjN 



(10) 



for all j > 2. By substituting (dU]) into ©, we find that 
the coefficients Uj satisfy a recursion relation identical to 



(11) 

Any solution to ([TT|) is a linear combination of the two 
independent solutions: the fast-decaying solution above 



— 1)1 and the slow-decaying one. 
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when j ^ I. For the first few coefficients we have ui = 
and U2 — M3. The boundary condition ui = implies 
that we can not express the solution solely in terms of 
the rapidly decaying solution Uj = l/(j — 1)!. Yet, the 
slowly decaying solution is unphysical because the sum 
Uj diverges. Therefore, the only possible solution is 
the trivial one, Uj = for all j > 1, and we conclude that 
there are no macroscopic hubs in the first generation. 

Similarly, there are no macroscopic hubs in higher gen- 
erations. If there is a hub in the nth generation, the 
recurrence (jlip holds for j > n with the boundary condi- 
tion Un = 0, or equivalently u„+i = Un+2- By repeating 
the steps above, we conclude that all coefficients uj van- 
ish. 



IV. HIGHLY CONNECTED NODES 

In the previous section we have shown that the degree 
of the most connected node in the first generation cannot 
be macroscopic. It is then natural to assume that_the 
degree scales as iV^/'' with /3 > 1, or equivalently [30| 

Hj=H2hj, with H2~CN^/^, (12) 

for all j > 2. The fractions hj satisfy 

i>j 

obtained by substituting (IT2]) into ([9]). Therefore, the 
recursion relation ([B]) becomes 



(14) 



The boundary conditions is 

hi =0, h2 = 1. 



(15) 



Certain features can be determined without solving 
Eqs. (|14p - p3|) . For instance, to obtain the total num- 
ber of descendants ft-tot = Si>2 multiply ([H)) by j 
and sum over all j . This yields 



1 



13-1 



(16) 



Since /itot > 1, we have the bound /3 < 2. 

Similarly, by multiplying (I14p by and summing over 
all j we determine the average generation number 



Ej>2i^j _ 2 

j:,>2hj "2-/3- 



The second order linear difference equation p4|) has 
two independent solutions. We obtain the asymptotic 
behavior of these solutions by converting the difference 
equation (jl4p into a differential equation. If hj de- 
cays sufficiently slow, we may approximate Eq. (|14l) by 
the differential equation dh/dj ~ —(ih/j with solution 
hj ^ at large j. Otherwise, if hj decays very quickly, 
we have hj ~ (3 hj^i/j in the leading order and therefore 
hj ^ /^Vj'- Using this solution as an integrating factor, 
that is, substituting hj = qj (3^ /jl into Eq. (|14|) . we have 
the recurrence qj —qj-i — /3gj+i/(j + l). This recurrence 
reduces to the differential equation dq/dj ~ f3q/j, with 
power-law solution qj ~ j^. Therefore the asymptotic 
behavior of the second solution to Eq. (fT4|) is 



hj 



(17) 



Equations (|T4|) -([T5 |) admit a solution for arbitrary 
value oi (3. In general, such a solution is a linear 
combination of the two solutions above. However, the 
slowly decaying solution is unphysical. If Hj j^^N^^^ 
for J 1, then the depth of the subtree emanating 
from the most connected node in the first generation, 
jmax ^ N^/^ , as follows from the criterion Hj^^^ ^ 1, 
would exceed the maximal depth ([S]). Therefore, we must 
find the special value of (3 for which solution to (|14p - P^ 
is the rapidly decaying solution with the asymptotic be- 
havior p?)) . Therefore, /3 is analogous to an eigenvalue. 

To determine the proper eigenvalue (3 we use the gen- 
erating function technique. We write 



(18) 



J>2 



and convert the difference equation p4p into an integral 
equation 



Jf(z) = 13 / da;Jf(x) + 



Jf(z) 



(19) 
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Note that !K(z) = z^ + z'^ + - • • for small z and in addition, 
Eq. ((Ill) sets = /itot = " 1)"^- 

Given the form of Eq. ([T9l) , we focus on the integral of 
the generating function, ^(z) — dx3-C{x), and convert 
the integral equation (|19p into the differential equation 



+ /3z(l-z)-ij = z2(l-z)-i. 



(20) 



The boundary condition is 3^(0) = 0. We use the inte- 
grating factor I{z) = e~^^(l — z)~^ to simplify (I^Hl) to 
d{yi)/dz = z^{l - z)-^I. Hence, the solution of (gHl is 

J{z) = e^(-i)(l - z)^ dxe^(i-) (^-3^ . (21) 

The behavior of the generating function in the limit 
z ^ 1 reflects the behavior of the coefficients hi in the 



limit j — ^ cxD. Therefore, we evaluate the integral in ([21 
in the limit z ^ 1, 



dxe^'^^-^^ [1 - 2(1 - x) + (1 - xf] (1 - x)-^-^ 
f dye^y[y-'-^ -2y' 



E 



fc! 



1-2 



22/ 



fc=0 

/(/3) + (1 - z)-^ 



1 2-;3 



(1-z) 



L/3 /3-1' 
The function /(/3) is given by the infinite sum 



m - E 



1 



1 



fc=0 



k\ \k~ P fc + l-/3 ' k + 2- P 



We can also express this function in terms of the incom- 
plete Gamma function 7(0,0;) = dtt""^^ e~*, 



/(/?) = 



/3(/3-l) 



(22) 



Using !K = dy/dz and the leading z — ;> 1 asymptotics 
of the integral in ((2T|) we find 



J{(z) 



1 



P-\ 



Pf{P){l- zf-^ as z^l. (23) 



Generally, the generating function has a regular compo- 
nent and a singular one, J{(z) = ^K].cg(z) + J{sing(-z), with 



JCrcg(^) 



1 



+ const X (1 — z), 



:K,i,,s{z)^-PfiP){i-zf-\ 

Since z^j~^ ^ (1 — z)'^~^ when z — ;> 1, we identify the 
singular component with the unphysical solution. Hence, 



the singular term must vanish, and the exponent /? is root 
of the transcendental equation 



/(/?) = 0. 



(24) 



By solving this equation numerically, using the bisection 
method [3l| for example, we find the eigenvalue to essen- 
tially arbitrary precision 

/3 = 1.351746470331... . 

The above behavior extends to higher generations. In 
general, the degree of the most connected node at the nth 
generation grows algebraically with the total number of 
nodes, Hn+i N-^^^", and the exponent /?„ is function 
of the generation index n. Let Hj be the number of 
descendants of the most connected node at generation 
J > n. The coefficients hj, defined by Hj = Hn+ihj, 



again satisfy the recursion ()13p with the boundary con- 
dition hn — and /i„+i = 1. Thus, the coefficients hj 
decay super-exponentially as in (fTTl) . 

The exponents /3„ are obtained by repeating the steps 
above, and we merely quote the final results. First, the 
integral of the generating function is 



J(z) 



/o (1-a;) 
Second, the function /(/3) is now 



fc! ^-^ \m J k + m ~ B„ 

k=0 m=Q ^ ' 



(25) 



(26) 



The values of the exponents /3„ for n < 8 are listed in 
table I. The exponents increase with n and therefore, the 
degree of the most connected nodes declines sharply as 
distance from the root increases. 



n 


/3n 


htot 


(Aj) 





1 


1 


2 


1 


1.351746 


2.8429 


2.0852 


2 


1.682201 


2.9316 


2.1466 


3 


2 


3 


2.1945 


4 


2.309250 


3.0551 


2.2336 


5 


2.612266 


3.1012 


2.2665 


6 


2.910493 


3.1405 


2.2949 


7 


3.204901 


3.1747 


2.3197 


8 


3.496180 


3.2048 


2.3418 



TABLE I: The exponent I3„, the ratio /itot between the to- 
tal number of descendants and the degree, and the average 
distance of a descendant (Aj) versus generation n. The ex- 
ponents Pn are roots of the transcendental equation f{l3„) — 
with /(/3n) given by (|26p . The quantities htot and (Aj) are 
given in equations P5)) and ((2H), respectively. 



The expression ([26)) holds for all n as long as /3„ is 
noninteger. There are, however, two exceptions. We have 



— theory 
»— • simulation 
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FIG. 2: The degree distribution pt versus degree k. The sim- 
ulation results (bullets) are from 10^ independent realizations 
with A'^ = 10® . Also shown is a straight line that corresponds 
to the theoretical result ([T]). 



already seen that for the zeroth generation l3o — 1 and 
the coefheients hj can be determined analytically, hj = 
— 1)!. For the third generation, 133 — 2 and again, 
the coefficients hj have simple form 



3 • 2^- 



J 



(j 1)! 



(27) 



This can be verified by direct substitution into ([T4| . Of 
course, the solution (1271) agrees with the general asymp- 
totic behavior p7|) . 

Using /3„, we calculate several other useful properties. 
First, the ratio /itot between the total number of descen- 
dants and the degree equals 

(28) 



This quantity mildly increases with generation and even- 
tually saturates (table 1). Also, we mention the av- 
erage distance of a descendant (Aj) = (j) — n with 
0) = Ej>„ jhj/ J2j>n ^3- This quantity is given by 

71 - /3„ - 1 



(Aj) = 



(29) 



When n = 3, (Aj) = (e^ - 3)/2. Like /itot, the aver- 
age distance slowly grows with generation but ultimately 
approaches a constant. Hence, statistical properties of 
average quantities such as the total number of descen- 
dants and the distance to a descendant weakly depend 
on depth, but statistical properties of extreme quantities 
such as the degree of the most connected node strongly 
depend on depth. 



V. THE DEGREE DISTRIBUTION 

The degree distribution is an important quantity that 
characterizes local properties of networks. For many 




FIG. 3: The exponent 7„ = 1 -I- /3„ versus n. 



growing networks, the degree distribution satisfies closed 
recurrence equations and consequently, this quantity can 
be determined analytically. It appears impossible to find 
a closed set of equations that govern the degree distri- 
bution for the random ancestor tree. Remarkably, the 
closed set of equations ^ allows us to obtain the tail of 
the degree distribution analytically. 

The power-law degree of the most connected node (fT2|) 
already shows that the degree distribution has power-law 
tail. Let pk be the fraction of all nodes with degree k. As 
announced in ([T]), this distribution decays algebraically. 



Pk - k-'^ with 7 = /? + 1, (30) 

for k ^ 1. Indeed, if we substitute fcmax ^ N^^l^ 
with j3 = j3i into the extreme statistics criterion, 
^^k>k Pk ~ we obtain ([50]) . Here, we take (3 — Pi 
because the exponents /3„ increase monotonically. 

Our numerical simulations results are in excellent 
agreement with the theoretical prediction ([T]). The simu- 
lations are straightforward to implement: with each new 
node, a target node is selected at random, and then, the 
attachment node is selected at random from the target 
node and all of its ancestors. Since the average depth 
remains finite, the computational cost is proportional to 
the network size and we can simulate networks as large as 
N — 10*. We comment that many real- world networks 
have power-law degree distributions [2i] as in (j30p with 
exponent 7 in the range 2 < 7 < 3. For example, for the 
Internet at the Autonomous Systems level, as of 2007, 
the exponent is approximately 7 = 2.5, see [32| . 

Remarkably, the random ancestor tree has a family 
of degree distributions, each characterized by a distinct 
nontrivial exponent. The fraction pn.k of nth generation 
nodes with degree k decays algebraically 



Pn,k^k with 7„ = /3„-|-l. 



(31) 



As in the preferential attachment network, the exponent 
that characterizes the degree distribution increases with 
generation |33|. 
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Prom the roots of the transcendental equation 
= with /(/3„) given by we find that the 

exponent 7„ grows hnearly with n, that is (figure [3]) 



7n 



(32) 



Since = J2n>i 9nPn,k and since (/„ increases with n, 
the dominant contribution to the distribution pk comes 
from first generation nodes. Hence, 7 = 71. This be- 
havior is in contrast with the preferential attachment 
network where the behavior at the average generation 
dominates the degree distribution [ssj . 

For completeness, we mention that the in-component 
distribution mimics the degree distribution. The degree 
of the in-component of a node is defined as the total num- 
ber of its descendants. The quantity htot above is always 
finite and hence, the total number of descendants of the 
most connected nodes is proportional to the degree. As 
consequence, the fraction Is of nodes with in-component 
degree s decays algebraically. Is ~ s~'^ . 



VI. LEAVES, STARS, AND CHAINS 

It is also possible to calculate a few other structural 
properties of the network including the average number 
of leaves as well as the probabilities that the network has 
an extreme structure such as a star or a chain. 

Nodes with no incoming links, or equivalently, nodes 
with degree one are termed leaves. To calculate the to- 
tal number of leaves, we must first find a more detailed 
quantity: Ln{N), the total number of leaves at the jth 
generation. This quantity evolves as follows 



dN 



E 



iN {n + l)N 



(33) 



Each new node is necessarily a leaf and hence, the gain 
term is as in (jSj . The loss term reflects the fact that any 
new link into a leaf decreases the number of leaves by 
one. 

We assume that the number of leaves in each genera- 
tion is macroscopic. 



L„iN)=£„N. 



(34) 



By substituting this form into the rate equation p3l) , we 
find that leaves constitute a finite fraction of all nodes in 
each generation, — 2ii , or equivalently. 



{n + 2)\ 



(35) 



The fraction of leaves grows with depth: 2/3 of all nodes 
in the first generation are leaves; 3/4 of all nodes in the 
second generation are leaves; etc. The total number of 
leaves, Ltot 
we obtain 



is the sum Ltot ~ J2n>i^" ^^'^ from ([35l 



(36) 



As is often the case in complex networks, a large fraction 
of all nodes have no incoming links. 

It is also possible to calculate the likelihood that the 
tree has one of two extreme topologies: (i) a star where all 
nodes link to the root, and (ii) a linear chain of length N. 
Let Sn be the probability that the tree is a star graph. 
This probability satisfies the recursion 



S 



N+l 



3JV 



1 

N 



N -1 
2N 



(37) 



The network is a star graph only if it was previously a 
star. Furthermore, the factor 1/A^ is the probability that 
the root is the target node and the factor {N — 1)/{2N) 
is the probability that the target node is any one of the 
iV— 1 leaves, but the actual link is made to the root. With 
the boundary condition = 5*2 = 1, the probability Sn 
decays exponentially with the network size. 



S 



N 



N2- 



-(N-l) 



(38) 



The likelihood that the tree has a chain topology. Cat, 
obeys the recursion equation Cn+i = Cn/N"^- This re- 
cursion refiects that the newest node must always be se- 
lected both as the target node and as the attachment 
node. The probability for this event is 1/iV^. Using the 
boundary condition C2 = C3 = 1 , the likelihood of grow- 
ing a chain decays extremely rapidly. 



Cn — 



1 



[{N-\W 



(39) 



Therefore, it is far more likely that the random ancestor 
tree is a star than it is a chain. This is another conse- 
quence of the shallow nature of the tree. 



VII. CONCLUSIONS 

We introduced a random structure where the geneal- 
ogy governs the evolution. The random ancestor tree has 
a remarkably rich structure. The network is very tight 
with a sharply-decaying distribution of depth. There is a 
single macroscopic hub that is connected to a finite frac- 
tion of all nodes along with multiple highly connected 
nodes. 

The network is strongly stratified because the ge- 
nealogical structure controls the growth. The most con- 
nected node at distance n from the hub has a sub- 
macroscopic degree that scales as N^^^" with system size 
N. Interestingly, the exponents /3„ are generally tran- 
scendental numbers. Moreover, the exponents /?„ grow 
monotonically with n, and thus, the connectivity sharply 
declines with increasing depth. As a consequence, the 
degree distribution has power-law tail, pk ~ k''^ with 
J = /3i + l and furthermore, the degree distribution varies 
strongly with depth. 

We obtained all of these features analytically using the 
total number of descendants of a highly connected node 
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as a function of generation. This quantity obeys a closed 
set of equations and it allows us to determine many scal- 
ing properties including in particular, the exponent that 
governs the tail of the degree distribution. This theoret- 
ical technique has promise in other growing trees prob- 
lems when the degree distribution does not satisfy closed 
equations [13, [sj] , especially in situations where the dis- 
tribution of depth becomes independent of system size 
asymptotically. 

The random ancestor tree includes no control param- 
eters but can be easily generalized by various modifica- 
tions of the attachment process. More generally, the tar- 
get node can be selected at a rate that is proportional 
to the degree and similarly, the attachment node can 
be determined according to either the generation num- 
ber or the degree. We envision that such generalizations 



can be useful for controlling the degree distribution or 
the number of macroscopic hubs and hence, relevant for 
modeling complex networks. The most challenging gen- 
eralization is the theoretical understanding of ancestor 
networks with cycles. 
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